Three approaches to supervised learning for compositional data with pairwise logratios

نویسندگان

چکیده

The common approach to compositional data analysis is transform the by means of logratios. Logratios between pairs parts (pairwise logratios) are easiest interpret in many research problems. When number large, some form logratio selection a must, for instance an unsupervised learning method based on stepwise pairwise logratios that explain largest percentage variance dataset. In this article we present three alternative supervised methods select best dependent variable generalized linear model, each geared specific problem. first features unrestricted search, where any can be selected. This has complex interpretation if overlap, but it leads most accurate predictions. second restricts occur only once, which makes corresponding intuitively interpretable. third uses additive logratios, so $K-1$ selected involve exactly $K$ parts. fact searches subcomposition with highest explanatory power. Once identified, researcher's favourite representation may used subsequent analyses, not Our methodology allows or non-compositional covariates forced into models theoretical knowledge, and various stopping criteria available information measures statistical significance Bonferroni correction. We illustration approaches dataset from study predicting Crohn's disease. excels terms predictive power, other two interpretability.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Semi-Supervised Metric Learning Using Pairwise Constraints

Distance metric has an important role in many machine learning algorithms. Recently, metric learning for semi-supervised algorithms has received much attention. For semi-supervised clustering, usually a set of pairwise similarity and dissimilarity constraints is provided as supervisory information. Until now, various metric learning methods utilizing pairwise constraints have been proposed. The...

متن کامل

Feature Learning Based Deep Supervised Hashing with Pairwise Labels

Recent years have witnessed wide application of hashing for large-scale image retrieval. However, most existing hashing methods are based on hand-crafted features which might not be optimally compatible with the hashing procedure. Recently, deep hashing methods have been proposed to perform simultaneous feature learning and hashcode learning with deep neural networks, which have shown better pe...

متن کامل

A Bayesian method for detecting pairwise associations in compositional data

Compositional data consist of vectors of proportions normalized to a constant sum from a basis of unobserved counts. The sum constraint makes inference on correlations between unconstrained features challenging due to the information loss from normalization. However, such correlations are of long-standing interest in fields including ecology. We propose a novel Bayesian framework (BAnOCC: Bayes...

متن کامل

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

Supervised Learning Approaches for Rating Customer Reviews

Social media has become highly popular in recent years that people are expressing their views, thoughts about any product, movie through reviews. Reviews are having a great influence on people and decisions made by them. This has led researchers and market analyzers to analyze the opinions of users in reviews and model their preferences. Sometimes reviews are also scored in terms of satisfactio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Journal of Applied Statistics

سال: 2022

ISSN: ['1360-0532', '0266-4763']

DOI: https://doi.org/10.1080/02664763.2022.2108007